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Abstract 

We present a model for estimating the probabilities of future earthquakes of magnitudes m > 4.95 
in Italy. The model, a slightly modified version of the one proposed for California by \Helmstetter et al\ 
[2007] and \Werner et gl] [2010a] , approximates seismicity by a spatially heterogeneous, temporally homo- 
geneous Poisson point process. The temporal, spatial and magnitude dimensions are entirely decoupled. 
Magnitudes are independently and identically distributed according to a tapered Gutenberg-Richter 
magnitude distribution. We estimated the spatial distribution of future seismicity by smoothing the 
locations of past earthquakes listed in two Italian catalogs: a short instrumental catalog and a longer 
instrumental and historical catalog. The bandwidth of the adaptive spatial kernel is estimated by op- 
timizing the predictive power of the kernel estimate of the spatial earthquake density in retrospective 
forecasts. When available and trustworthy, we used small earthquakes m > 2.95 to illuminate active fault 
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structures and likely future epicenters. By calibrating the model on two catalogs of different duration 
to create two forecasts, we intend to quantify the loss (or gain) of predictability incurred when only a 
short but recent data record is available. Both forecasts, scaled to five and ten years, were submitted to 
the Italian prospective forecasting experiment of the global CoUaboratory for the Study of Earthquake 

to 



Predictability (CSEP). An earlier forecast from the model was submitted by Helmstetter et al. 2007 



the Regional Earthquake Likelihood Model (RELM) experiment in California, and, with over half of the 
five-year experiment over, the forecast performs better than its competitors. 



1 Introduction 



In this article, we document the calibration of a previously published, time-independent model of earth- 
quake occurrences to the region of Italy. We extracted probabilities of future m > 4.95 shocks for a 
five- and ten-year period in a format suitable for prospective testing within the Italian earthquake pre- 
dictability experiment Schorlemmer et al. 2010a| . Previously, Helmstetter et al. |2007| calculated a 
probabilistic earthquake forecast for m > 4.95 for the region of California over a five year duration. The 
forecast is currently being tested within the Regional Earthquake Likelihood Model (RELM) experiment 
[_FzeM[ |2007| . After more than half of the five years over, the forecast cannot be rejected by a suite of 
tests and performs better than competing forecasts [Schorlemmer et al. 2010b| . Werner et al. 2010a| 



made small modifications to the model by Helmstetter et al. [2007| and re-calibrated it on updated data 



to generate a new earthquake forecast for California. This forecast is under test within the California 



branch of the global CoUaboratory for the Study of Earthquake Predictability (CSEP) [Jordan 2006 



Zechar et al. 2009 . To calculate future earthquake potential in Italy, we used the same model with some 



minor modifications. One modification concerns the estimation of the completeness threshold, which was 
difficult to estimate at the small spatial scales that were possible with the high quality data set available 
in California \ Wemer et al.\^010a\\H'elmstetter et aH|"2007| . Instead, we set a single magnitude threshold 
for the entire region. 

Smoothed seismicity models, such as the present one, usually do not incorporate geological or tectonic 
observations. Rather, the models are calibrated on the seismicity data available from earthquake catalogs. 
One may justifiably question the hypothesized validity that the short (decadal) periods covered by high- 
quality instrumental catalogs suffice to forecast the locations of future large earthquakes that have very 
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57 low occurrence probabilities. Even if the spatial distribution of seismicity is reasonably stable up to 

58 geological timescales, estimating this distribution from a short time window of observations is difScult. 

59 A partial solution to this problem is to estimate a predictive spatial distribution, rather than the 

60 observed spatial distribution. That is, rather than estimating the density of past earthquakes, one 

61 divides available data into separate learning and target sets to estimate a predictive density from the 

62 learning catalog that is evaluated and optimized on the target quakes. This cross-validation method 

63 we employed generates smoother forecasts than a simple kernel density estimation method because the 

64 locations of future - rather than past - earthquakes are predicted, and such locations might be in 

65 regions of little previous seismicity. Nonetheless, this is only a partial solution: \Kagan and Jackson\ 

66 1994 conjectured that the optimal predictive horizon of a forecast based on smoothed seismicity scales 

67 proportionally with the learning catalog because of spatio-temporal clustering. Thus, the forecasts by 

68 [Werner et al.\ |2010a| and \Helmstetter et aL\ [2007| , which were calculated from about 25 years of high 

69 quality Californian data, should perform well for moderate earthquakes over similar timescales, but longer 

70 periods relevant for seismic hazard estimates and building codes might require longer input data sets. On 

71 the other hand, according to Kagan and Jackson 1994 , it remains an untested hypothesis that seismicity 

72 estimates based on geological observations, i.e. the earthquake history of several thousands of years, can 

73 provide more predictive and relevant information for engineering design than high-quality low-threshold 

74 instrumental catalogs. Their argument is based on two points: first, the quality of geologic observations 

75 for seismic hazard is often low compared to the quality of modern earthquake catalogs; second, seismicity 

76 exhibits long-term spatio-temporal variations that necessitate an appropriate weighting of the information 

77 contained in observations of recent earthquakes and those that occurred in the distant past. 

78 To begin to quantify the impact of the duration of the learning catalog on the predictive power of 

79 smoothed seismicity forecasts, we calculated two forecasts of the smoothed seismicity model by calibrating 

80 the model on two catalogs of different duration. One forecast is based on the most recent thirty years of 

81 instrumental data, while the other one is calculated from a century of combined historic and instrumental 

82 data. We submitted the forecasts to the European CSEP Testing Center at ETH Zurich, to be tested 

83 and compared with competing forecasts within the framework of the Italian earthquake predictability 

84 experiment CSEP-Italy [see \Schorlemmer et al. 2010a . Comparing the two forecasts' performance might 

85 shed light on the impact of the length of the learning catalog. 

86 The article is structured as follows. In section [2] we describe the Italian earthquake catalogs from 
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which we estimated future earthquake potential in Italy. Section[3]describes the model and its calibration 
on the two data sets. We present the earthquake forecasts in section [4] before concluding in section [S] 



2 Data 



2.1 The CSI 1.1 Catalog 1981-2002 



For more details about the catalogs discussed here and below, see Schorlemmer et al. 2010a and 



references therein. As primary data source for the forecast based on smoothed instrumental seismicity, 
we used the Catalogo della Sismicita Italiana (catalog of Italian seismicity, CSI 1.1) \Castello et al.\ 
2007 Chiarabba et al. 2005| available from http://www.cseptesting.org/regions/italy The CSI 
catalog spans the time period from 1981 until 2002 and reports local magnitudes, in agreement with 
the magnitude scale that will be used during the prospective evaluation of forecasts. \Schorlemmer et al.\ 



2010a found a clear change in earthquake numbers per year in 1984 due to the numerous network changes 



in the early 1980s and recommend using the CSI data from mid 1984 onwards. We therefore selected 
earthquakes listed in the CSI catalog from 1 July 1984 until 31 December 2002 within the CSEP-Italy 
collection region defined by Schorlemmer et al. [2010a . To avoid possible contamination from quarry 
blasts and volcanic microseismicity, we set a uniform completeness magnitude threshold of mt =2.95 for 
the entire collection region, which is higher than the threshold of mt = 2.5 calculated by [Schorlemmer] 



et al. 2010a for onshore seismicity. 



2.2 The BSI Catalog 2003-2009 



For prospective tests of the submitted forecasts [see Schorlemmer et al. 2010a , the Bollettino Sis- 



mico Italiano (BSI) earthquake catalog recorded by the Istituto Nazionale di Geofisica e Vulcanolo- 
gia (INGV) \Gruppo di lavoro BSl\ |2002| \Amato et 'al] |2006| will be used. The BSI is available at 
|http : //bollettinosismico . rm . ingv . it] and since July 2007 at |http: //ISIDe . rm. ingv. it/[ We used 
earthquakes listed in the BSI from 1 January 2003 until 25 June 2009. Because the data quality of small 
quakes in the BSI catalog between 2003 and 2005 is questionable, we applied a relatively high magnitude 
threshold of mt = 2.95. For the forecast based on recent instrumental observations, we merged the BSI 
and CSI catalogs (hereinafter called the "merged instrumental catalog" or MIC). 
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2.3 The CPTI08 Catalog 1901-2006 



For the forecast based on instrumental and historical seismicity, we used the Catalogo Parametrico dei 
Terremoti Italiani (parametric catalog of Italian earthquakes, CPTI08) \Rovida and the CPTI Working 



Group 2008 available from http://www.cseptesting.org/regions/italy The CPTI08 catalog, a 



preliminary revision of the 2004 CPTI04 catalog Gruppo di lavoro GPTI 2004 , covers the period 



from 1901 until 2006 and is based on both instrumental and historical observations. The catalog lists 
moment magnitudes that were estimated either from macroseismic data or calculated using linear ("ad- 



hoc" Rovida and the CPTI Working Group 2008 ) regression relations between surface wave, body 



wave or local magnitudes Gruppo di lavoro MPS 2004 . Castellaro et al. 2006 showed that standard 



linear regression (SLR) can lead to biased and uncertain magnitude conversions because magnitude 
observations and associated errors violate the simplifying assumptions of SLR. As a remedy, | Gastellar^ 
et al. [2006 advocate the use of general orthogonal regression. Therefore, the CPTI08 catalog can be 
expected to contain biases and large uncertainties (like most historical catalogs) that might affect the 



quality of earthquake forecasts. In their global forecasts, Kagan and Jackson 2010a| therefore used 



more homogeneous catalogs: the global Centroid Moment Tensor (CMT) catalog Ekstrom et al. 2005 



and the global Preliminary Determination of Epicenters (PDE) catalog by the U.S. Geological Survey 



U.S. Geological Survey 2001 . Nonetheless, the CPTI08 catalog offers a much longer training catalog 



(a century) than either of these two global catalogs (30 to 40 years), allowing us to investigate the effect 
of the length of the training catalog on the generated forecasts. Therefore, we accepted the potential 
shortcomings of the CPTI08 catalog for this study. Because the prospective experiment will use local 
magnitudes, we converted the moment magnitudes to local magnitudes using the same regression equation 
that was used to convert the original local magnitudes to moment magnitudes for the creation of the 



CPTI catalog Gruppo di lavoro MPS 2004 Schorlemmer et al. 2010a 



mi = 1.231(mvK - 1-145) 



(1) 



As for the BSI and CSI catalogs, we only selected shocks within the collection region and with depths 
shallower than 30km. Some quakes, mostly during the early part of the CPTI catalog, were not assigned 
depths. We included these earthquakes as observations within the testing region because it is very 



unlikely that they were deeper than 30km [see also Schorlemmer et al. 2010a|. We selected earthquakes 
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with moment magnitudes mw > 4.75 \Schorlemmer et~aT. 2010a , which corresponds to local magnitudes 
rriL > 4.45. 



3 Model Calibration 



The model has previously been documented by Helmstetter et al. 2007| and Werner et al. [2010a , and 
we will only provide a brief overview here. First, earthquake catalogs were declustered to remove the 
strong influence of triggered sequences (section |3.1[ ); if we did not decluster, we would need to use a more 
complicated, time-dependent model that removes the influence of aftershocks with the Omori-Utsu law 
Omori |1894[[T7teM et al\ 1995| . Once declustered, the seismicity was smoothed with an adaptive power- 



law kernel (section |3.2[ ). The bandwidth of the kernel at each earthquake epicenter adapts to the distance 
to the fcth nearest neighbor. To estimate the optimal number of neighbors to include in the smoothing, 
we divided the catalog into two non-overlapping sets: a learning catalog and a testing catalog. In section 
3.3[ we determine the optimal number of neighbors by calculating the spatial density of seismicity from 
the learning catalog and evaluating its predictive power on the testing catalog. The spatial density 
was scaled to the number of expected earthquakes by using the mean number of observed earthquakes 
(section|3.6[). Finally, to obtain a rate-space-magnitude forecast, we multiplied the scaled spatial density 



by a tapered Gutenberg-Richter magnitude-frequency distribution (section 3.5 1. In contrast to the model 



presented by Helmstetter et al. 2007 and Werner et al. 2010a , we used a homogeneous threshold for 



the completeness magnitude because the previous method, which estimated the threshold as a function 
of space, did not perform well for Italy. 



3.1 Declustering 

To estimate the spatial distribution of spontaneous earthquakes, we used the declustering algorithm 
proposed by \Reasenberg\ [i985| , modified slightly hy \Helmstetter et aL| [2007] a,nd \Werner et oT] [2010a| . 
As in these prior studies, we set the input parameters to r/act = 8, Xk = 0.5, pi = 0.95, Tmin = 1 day and 
Tmax ~ 5 days. We varied Xmeff according to the different learning catalogs we used. As the interaction 

instead 



distance, we used the scaling r — 0.01 x 10 km suggested by Wells and Coppersmith 



1994 



of r = 0.011 X lo"'^^'^ km and r < 30 km in Reasenberg's algorithm. Like 



Werner et al. 



2010a 



, we set 



the localization errors to 1 km horizontal and 2 km vertical. We found that about 80% of earthquakes 
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167 in the merged instrumental catalog (MIC) are spontaneous, while in the CPTI catalog, about 92% of all 

168 shocks are independent according to Reasenberg's classification. 



3.2 Adaptive Kernel Smoothing of Declustered Seismicity 

We estimated the density of spontaneous seismicity in each 0.1 by 0.1 degree cell by smoothing the 
location of each earthquake i with an isotropic adaptive power-law kernel Kdi{r): 

where di is the adaptive smoothing distance and C{di) is a normalizing factor, so that the integral of 
Kdi (r) over an infinite area equals 1. 

We measured the smoothing distance di associated with an earthquake i as the horizontal distance 
between event i and its feth closest neighbor. The number of neighbors, k, is an adjustable parameter. 



estimated by optimizing the forecasts (see section 3.31. We imposed di > 0.5 km to account for location 
uncertainty. The kernel bandwith di thus decreases if the density of seismicity is high at the location fl 
of the earthquake i, so that we have higher resolution (smaller di) where the density is higher. 
The density at any point r was estimated by 

Kr-) = ^7^d.(r--rl) (3) 

i = l 

where Ni is the total number of earthquakes in the learning catalog. However, the forecasts are given as 
an expected number of events in each cell of 0.1°. We therefore integrated equation ^ over each cell 
and summed over all contributing earthquakes to obtain the seismicity rate of each cell. 



3.3 Optimizing the Spatial Smoothing 



184 We estimated the parameter k, the number of neighbors used to compute the smoothing distance di in 

185 equation |3|, by maximizing the likelihood of the model. We built the model fi'{ix,iy) in each cell {ix,iy) 

186 from the data in the learning catalog and evaluated the likelihood of target earthquakes in the testing 

187 catalog. Because we assumed independence of the spatial density from the magnitude distribution and 

188 the total expected number of events, we optimized the normalized spatial density estimate in each cell 
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189 (ix,iy) using 



190 where A^t is the number of observed target events. The expected number of events for the model ji* thus 

191 equals the observed number Nt- 

192 The log-likelihood of the model is given by 

L ^^^\ogp[^l*{i^,iy),n\ (5) 

193 where n is the number of events that occurred in cell (ix, iy)- To adhere to the rules of the CSEP-Italy 

194 predictability experiment, we assumed that the probability p of observing n events in cell {ix,iy) given 

195 a forecast of n*{ix,iy) in that cell is given by the Poisson distribution 

p[^'{ix,iy),n] = [f.*iix,iyr ^^Pt-^J^-'^'^)] (6) 

196 We built a large set of background models ^* by varying (i) the starting times, end times and magnitude 

197 thresholds of the learning and testing catalogs, and (ii) the catalog (either the MIC or the CPTI catalog). 

198 We evaluated the performance of each model by calculating its probability gain per target earthquake 

199 relative to a model with a uniform spatial density: 

«-«(^) m 

200 where Lq is the log-likelihood of the spatially homogeneous model. 

201 3.4 Results of the Spatial Optimization 

202 Tables[l]and[2]show the results of the spatial optimization on the MIC and the CPTI catalog, respectively. 

203 For each model, we found the optimal smoothing parameter k in the range [1, 50] by choosing the value 

204 that maximized the likelihood of the target earthquakes in the target catalog given the smoothed spatial 

205 density estimated from the learning catalog. We varied the magnitude threshold of the input catalog to 

206 test whether including small earthquakes results in greater predictability of future m > 4.95 earthquakes. 

207 We also changed the target periods to test the robustness of the results. 

208 In Figure[l] we show the probability gains per earthquake against the magnitude threshold of the two 
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learning catalogs. For comparison, we also included the gains obtained for the five-year period 2004-2008 
(inclusive) in California by Werner et al. |2010a . The gains obtained in Italy fluctuate strongly for 
different target periods, and it is difficult to detect a systematic trend in the gain as a function of the 
magnitude threshold of the learning catalog. In contrast to California, where around 25 earthquakes 
m > 4.95 tend to occur per five years, Italy experiences far fewer shocks of equal size; during the 1992- 
1996 period in the CPTI catalog, gains were calculated from only two target earthquakes. The small 
sample size of target earthquakes might explain the observed fluctuations in the calculated gains (see 
also below and section [5|. 

For the target period 1994-1998 of the MIC, the gains are especially small for low thresholds. For 
mt = 2.95, the smoothing parameter reached the maximum value 50 (model 7 in Table [TJ, realizing a 
gain smaller than unity, i.e. the uniform forecast outperforms the smoothed seismicity forecast (further 
increasing the amount of smoothing eventually leads to a uniform density, such that the gain would equal 
unity). The long-range smoothing required by the target events can be traced back to the occurrence 
of three earthquakes in 2002: the 6 September Sicily earthquake north-east of Palermo, the 31 October 
Molise earthquake and one of its aftershocks. The predicted densities in the relevant cells is increased 
by a factor of almost ten as the algorithm increases the smoothing from A: = 1 to fc = 50 (see also the 
discussion in section |5|. 

Whenever target earthquakes occur in previously active regions, the optimal amount of smoothing is 
small {k — 1), and the gains tend to be higher (see, e.g., model 13 in Table [l] for the 1994-1998 target 
period that includes the 1997 Colfiorito earthquake sequence). However, exceptions exist to the expected 
anti-correlation between k and G: the 15 target earthquakes during the 1997-2001 target period of the 
CPTI catalog were forecast best with a smoothing parameter fc = 15 yet realize a gain per earthquake 
of G = 3.41. 

To calculate the spatial densities for the final forecasts for the predictability experiment (model 20' 
in Table [l] obtained from the MIC and model 14' in Table [2] obtained from the CPTI catalog), we had to 
decide which magnitude threshold to apply to the learning catalog, which smoothing parameter to use, 
and whether to use all existing data up until the end of the two catalogs. We decided to use all available 
data in each catalog for the final density estimate so that the forecasts could benefit from as much 
data as possible. Moreover, despite the observed variability in gains against the magnitude threshold of 
the input catalog (discussed above), Figure [T] shows that, on average, there seems to be an advantage 
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in including small earthquakes for estimating the predictive spatial density (see also the discussion in 
section [sj. Therefore, to calculate the spatial densities for the final forecasts, we used the lowest reliable 
magnitude threshold for each catalog. Lastly, we decided to use an optimal smoothing parameter of 
k — 6, despite the large variability across magnitude thresholds and target periods, because the resulting 
density is slightly smoother than the one obtained from the median (k = 5) of the optimal values for 



the lowest magnitude thresholds and because both Werner et al. (2010a and Helmstetter et al. 12007 



used the same value. The two final predictive spatial densities based on the MIC and the CPTI catalog 
are model 20' in Table [l] and model 14' in Table |2j respectively. We discuss future improvements of the 
spatial optimization method in section [5] 



3.5 Magnitude Distribution 

We assumed that the cumulative magnitude probability distribution follows a tapered Gutenberg-Richter 
magnitude frequency distribution \Gutenherg and Richter 1944 with a uniform b-value and corner 



magnitude nic [Helmstetter et al. 2007 Eq. (10)] 



P(m) = 10 



exp 10 



^1.5(m„ 



10 



1.5(m — mc) 



(8) 



with a minimum target magnitude m„ 



■ 4.95 (for the five- and ten-year CSEP forecast group). Bird 



and Kagan 2004 p. 2393] classified the tectonic setting of Italy as an orogen situated at a continental 
convergent boundary and estimated rric = 8.461q 39- Kagan et al. 



2010 



Figure 1, Table 1] assigned 



onshore Italy to the category of "active continent" with nic = 7.59^0 25 and Italy's southern off-shore 
region to "trench" with rric = 8.571q35. For simplicity, we set a uniform value of rric = 8.0 to reflect 
these studies. This is likely to be a conservative choice for the most seismically active region of onshore 
Italy. We further used a b-value equal to one (a maximum likelihood estimate based on magnitudes 
above mt — 2.95 in the MIC resulted in 6 = 1.07). We integrated the magnitude distribution ^ in 
discrete bins of width 0.1 to conform to the rules of the experiment. 
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3.6 Expected Number of Events 



262 The expected number of events per year in each space-magnitude bin {ix,iy,im) was calculated from 

E {ix,iy,im) ^ >^ fJ-*{ix,iy) P{im.) (9) 

263 where /i* is the normalized spatial background density; P{im) the integrated probability of an earthquake 

264 in magnitude bin {im) defined according to equation Q and A is the expected number of earthquakes 

265 over a five- or ten-year period. To estimate the expected number of earthquakes, we counted the total 

266 number of observed m > 4.95 earthquakes in each (non-declustered) catalog and divided by the duration 

267 to obtain the mean number of events m > 4.95 per year. For the MIC, we estimated A'"*"^ = 1.24 per 

268 annum, while there are an average of A'^'''' = 1.72 earthquakes m > 4.95 per year in the CPTI catalog. 

269 To obtain five- and ten-year forecasts, we simply multiplied A by the number of years. Thus, based on 

270 the shorter MIC, we expect 6.2 (12.4) earthquakes from 1 January 2010 until 31 December 2014 (until 

271 31 December 2019), while based on the longer CPTI catalog, we predict 8.6 (17.2) earthquakes over the 

272 same periods. 

273 4 Five- and Ten- Year m > 4.95 Forecasts 

274 The five-year forecasts based on the merged instrumental catalog (MIC) and the CPTI catalog are shown 

275 in Figures [2] and |3] respectively. To obtain the ten- year forecasts, we doubled the rate in each space- 

276 magnitude bin because we assumed a temporally homogeneous Poisson process. The forecast based on 

277 the CPTI catalog is smoother than the map based on the shorter MIC because the 3,522 earthquakes 

278 of mostly small magnitudes m > 2.95 in the MIC cluster more strongly than the 623 events of larger 

279 magnitudes m > 4.45 in the CPTI catalog. The more evenly distributed epicenters of the CPTI catalog 

280 are more uncertain than those from the MIC, resulting in less clustering. 

281 We can compare the forecasts in Figures [2] and |3] with those of Kagan and Jackson [2010a Figure 

282 4] and Zechar and Jordan 2010 Figure 2]. \Kagan and Jackson 2010a used a fixed-bandwidth power- 

283 law kernel with an optimized bandwidth Vs = 5 km to smooth seismicity in Italy listed in the PDE 

284 catalog above a threshold rrit = 4.7. By visual inspection, their forecast is similar to the forecast based 

285 on the CPTI catalog in Figure [3] although their fixed bandwidth of = 5 km is much smaller than 
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the average of our optimal adaptive bandwidths (di) — 34.1 (see model 14' inTable[2]). The optimal 



smoothing bandwidths are different because Kagan and Jackson [2010a optimized their bandwidth for 



the 2004-2006 target period and smoothed earthquakes from a different data source. Given the observed 
dependence of the optimal smoothing distance on the chosen target period, we should expect to see 
differences in the optimal bandwidths. 



Zechar and Jordan 2010 smoothed the CSl, the CPTI and a merged ("hybrid") catalog with an 



optimized fixed-bandwidth Gaussian kernel. Again, different choices for the magnitude threshold and 
the learning data make a direct comparison difficult, except for the forecasts based on the CPTI catalog 



optimized for the 2002-2006 target period. Figure 1 of Zechar and Jordan 12010 shows that the optimal 



smoothing lengthscale is a = 75 km, while we obtained a mean bandwidth of (di) ~ 95.6 (model 1 in 
Table [2|, indicating broad agreement between the two methods. Neither Kagan and Jackson 2010a nor 



\Zechar and Jordan] 120101 investigated whether the optimal smoothing length scale varies with target 



periods. 



5 Discussion and Conclusions 



300 \Werner et al.\ |2010b| evaluated all time-independent five- and ten-year forecasts of the CSEP-Italy 

301 experiment retrospectively on data from the CSI and CPTI catalogs. Using the forecasts from the first 

302 round of submissions from 1 August 2009, they found that several of the modelers had committed errors 

303 in the calibration of their models to calculate forecasts, principally in the conversion of the moment 

304 magnitude scale of the CPTI catalog to the local magnitude scale that will be used for prospective 

305 testing. Our first submission of the forecast based on the CPTI catalog also contained an error because 

306 of a mistake in the conversion formula we used (see equation [l]) . In this article, we only discussed the 

307 corrected forecasts submitted during the second round (1 January 2010). 

308 In the future, we would like to make a number of improvements to the model. First, we used a 

309 relatively arbitrary declustering procedure based on Reasenberg's algorithm Reasenberg 1985 , although 

310 more objective methods exist [e.g. .^/iwong aZ. 2002 Console et al. 2010| . Second, contrary to the work 

311 by Helmstetter et al. 2007 and Werner et al. 2010a , we could not estimate the completeness threshold 

312 as a function of space using their method and then attempt to correct for missing small earthquakes 

313 because the results were unstable. In the future, we would use a more robust method for estimating the 
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completeness threshold. Third, we found that the optimal smoothing parameter varied substantially for 



different target periods, much more so than observed by Werner et al. 2010a . The small number of 



target earthquakes might have caused the fluctuations. In the future, the optimal smoothing parameter 
should be optimized jointly over many target periods. More generally, we'd like to assess the influence 



of the choice of the kernel function. For example, do anisotropic kernels [e.g., Kagan and Jackson 1994| 



improve the spatial forecasts? Does the optimal kernel choice depend on tectonic regime [e.g., Kagan and 



Jackson 2010a|b ? Should large earthquakes count more towards the density than small earthquakes 



[e.g., Kagan et al. 2007 



Smoothed seismicity models make the implicit assumption that the available earthquake catalogs are 
long enough to estimate predictive spatial densities. Kagan and Jackson [1994 , however, conjectured 
that the optimal forecast horizon of an earthquake forecast based on smoothed seismicity scales with 
the duration of the learning catalog. To begin to address this question, we provided two earthquake 
forecasts: one based on a relatively short (30 years) data set of lower magnitude threshold and the other 



based on a longer (100 years) catalog with higher magnitude threshold. If the conjecture by Kagan and 



Jackson 1994 is correct, the forecast based on the merged instrumental catalog (MIC) should perform 



better than the forecast based on the CPTI catalog over shorter periods, while the CPTI-based forecast 
should show more predictive skill at longer timescales. 



Helmstetter et al. 12007 and Werner et al. 2010a had previously found evidence for the hypothesis 



that including the locations of small earthquakes improves the forecasts of future epicenters of m > 4.95 
earthquakes in California [see also |/fanfa[ |1992[ |Marsara| |2005| \Helmstetter et aL\ |2005| \Sornette awd| 
I Werner] |2005a|b) for perspectives on the importance of small quakes]. The results of this study do not 
provide conclusive evidence for or against this hypothesis, perhaps because of the fluctuations induced 
by the small number of target earthquakes. A more robust cross-validation method for the optimal 
smoothing parameter might address this outstanding issue. 

The CSEP-Italy experiment, like its predecessor RELM, requires the use of the Poisson distribution 



for the number of earthquakes per space-magnitude bin Schorlemmer et al. 2010a 2007 . In principle 



however, each model should provide its own model-dependent uncertainty bounds Werner and Somette 



2008 . For the time-independent Poisson process model described here, the distribution of the number 



of shocks over the relevant flve- or ten-year timescales is assumed to be time-independent. However, 
the negative binomial distribution fits the number distribution better than the Poisson distribution 
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Schorlemmer et al. 2010b Kagan 2010 Werner et al. 2010a|b| . Therefore, in future iterations of the 
model, the Poisson distributions should be replaced by appropriate alternatives in each space-magnitude 
bin. A difficulty with this approach will be the estimation of parameter values based on small and 
possibly correlated samples. 

Nonetheless, despite the model's simplicity and its approximations, its five-year forecast submitted 



by Helmstetter et al. 2007 to the RELM experiment outperforms competitors after the first 2.5 years 



Schorlemmer et al. 2010b . Whether the model can perform similarly in Italy, a different tectonic region 



from the seismically much more active California, will be an interesting test of the universal applicability 
of the model's assumptions. 



Data and Sharing Resources 

We used three earthquake catalogs for this study: the Catalogo Parametrico dei Terremoti Italiani 



(Parametric Catalog of Italian Earthquakes, CPTI08) Rovida and the CPTI Working Group 2008], the 



Catalogo del la Sismtcita Italiana (Catalog of Italian Seismicity, CSI 1.1) \Castello et al. 2007 Chiarabba 
\et al. \ [2005| , and the Bollettmo Sismico Italiano (Italian Seismic Bulletin, BSI) \Gruppo di lavoro BSI\ 
|2002[ \ Amato et al. \ |2006| . The BSI is available at |http: //bollettinosismico . rm . ingv . it] and since 
July 2007 at |http: //ISIDe . rm. ingv. it/[ The particular versions of the CSI and CPTI catalogs we used 
are available at [http: //www. cseptesting. org/regions/ italy[ 
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Table 1: Results of the optimization of the spatial density estimate using the merged instrumental catalog (MIC) from 1 July 
1984 through 25 June 2009. We varied the learning and target catalogs. The target catalog is the MIC in the testing 
region. The input catalog is the dechistercd MIC in the collection region. JV; and Nt arc the number of earthquakes 
in the learning and testing catalog, respectively, L is the log-likelihood score of a model, G is a model's probability 
gain per earthquake over a spatially uniform model, k is the optimal number of neighbors to include in the bandwidth 
of the smoothing kernel, and (di) is the mean adaptive bandwidth in km. 
' indicates that k was not optimized but constrained to k = 6. 
* denotes that the maximum fc = 50 of the range [1, 50] was attained. 
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Table 2: Same as Table [T] but using the CPTI catalog from 1901 to 2006 as data set. 
' indicates that k was not optimized but constrained. 



20 



Figure 1: Probability gain per earthquake against magnitude threshold of the learning catalogs for various five-year target 
periods: blue - merged instrumental catalog (MIC); red — CPTI catalog; Cal. 2004-2008 — the gains obtained for 
California by I Werner et ai.||2010a|; homogeneous - the reference gain of a spatially homogeneous forecast. 



Figure 2: Earthquake forecast based on the merged instrumental catalog (MIC): Expected number of earthquakes mi^ > 4.95 
over the five-year period from 1 January 2010 until 31 December 2014 per (0.1°)^ based on smoothing the locations 
of earthquakes rrii^ > 2.95 in the instrumental catalog from 1 July 1984 until 25 June 2009. 



Figure 3: Earthquake forecast based on the CPTI catalog: Expected number of earthquakes > 4.95 over the five-year 
period from 1 January 2010 until 31 December 2014 per (0.1°)^ based on smoothing the locations of earthquakes 
rriL > 4.45 in the CPTI catalog from 1901 until 2006. 
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